Project-Team:CQFD

Project-Team Cqfd

Team, Visitors, External Collaborators

Overall Objectives

Presentation

Research Program

Application Domains

Dependability and safety

New Software and Platforms

New Results

A new characterization of the jump rate for piecewise-deterministic Markov processes with discrete transitions
Estimation of the average number of continuous crossings for non-stationary non-diffusion processes
ClustGeo: an R package for hierarchical clustering with spatial constraints
Change-point detection for Piecewise Deterministic Markov Processes
A sharp first order analysis of Feynman–Kac particle models, Part I: Propagation of chaos
A sharp first order analysis of Feynman–Kac particle models, Part II: Particle Gibbs samplers
Exponential mixing properties for time inhomogeneous diffusion processes with killing
Investigation of asymmetry in E. coli growth rate
Design of estimators for restoration of images degraded by haze using genetic programming
Controlling IL-7 injections in HIV-infected patients
Stochastic Control of Observer Trajectories in Passive Tracking with Acoustic Signal Propagation Optimization
Computable approximations for average Markov decision processes in continuous time
Zero-Sum Discounted Reward Criterion Games for Piecewise Deterministic Markov Processes
Approximation of discounted minimax Markov control problems and zero-sum Markov games using Hausdorff and Wasserstein distances
On the expected total cost with unbounded returns for Markov decision processes
Applying Genetic Improvement to a Genetic Programming library in C++

Bilateral Contracts and Grants with Industry

Bilateral Contracts with Industry

Partnerships and Cooperations

Dissemination

Bibliography

Inria | Raweb 2018 | Presentation of the Project-Team CQFD


	PDF	e-Pub

previous

Home | Next next

next

Section: New Results

Computable approximations for average Markov decision processes in continuous time

In this paper we study the numerical approximation of the optimal long-run average cost of a continuous-time Markov decision process, with Borel state and action spaces, and with bounded transition and reward rates. Our approach uses a suitable discretization of the state and action spaces to approximate the original control model. The approximation error for the optimal average reward is then bounded by a linear combination of coefficients related to the discretization of the state and action spaces, namely, the Wasserstein distance between an underlying probability measure $μ$ and a measure with finite support, and the Hausdorff distance between the original and the discretized actions sets. When approximating $μ$ with its empirical probability measure we obtain convergence in probability at an exponential rate. An application to a queueing system is presented.

Authors: Jonatha Anselmi (Inria CQFD), François Dufour (Inria CQFD) and Tomás Prieto-Rumeau.

previous

Home | Next next

next